Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Semi-automatic Ground Truth Generation for Chart Image Recognition

Identifieur interne : 001010 ( Main/Exploration ); précédent : 001009; suivant : 001011

Semi-automatic Ground Truth Generation for Chart Image Recognition

Auteurs : Li Yang [Singapour] ; Weihua Huang [Singapour] ; Lim Tan [Singapour]

Source :

RBID : ISTEX:E06B873A38619286D9CCF8A173943039F895847F

Abstract

Abstract: While research on scientific chart recognition is being carried out, there is no suitable standard that can be used to evaluate the overall performance of the chart recognition results. In this paper, a system for semi-automatic chart ground truth generation is introduced. Using the system, the user is able to extract multiple levels of ground truth data. The role of the user is to perform verification and correction and to input values where necessary. The system carries out automatic tasks such as text blocks detection and line detection etc. It can effectively reduce the time to generate ground truth data, comparing to full manual processing. We experimented the system using 115 images. The images and ground truth data generated are available to the public.

Url:
DOI: 10.1007/11669487_29


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Semi-automatic Ground Truth Generation for Chart Image Recognition</title>
<author>
<name sortKey="Yang, Li" sort="Yang, Li" uniqKey="Yang L" first="Li" last="Yang">Li Yang</name>
</author>
<author>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
</author>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E06B873A38619286D9CCF8A173943039F895847F</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11669487_29</idno>
<idno type="url">https://api.istex.fr/document/E06B873A38619286D9CCF8A173943039F895847F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001B31</idno>
<idno type="wicri:Area/Istex/Curation">001A21</idno>
<idno type="wicri:Area/Istex/Checkpoint">000983</idno>
<idno type="wicri:doubleKey">0302-9743:2006:Yang L:semi:automatic:ground</idno>
<idno type="wicri:Area/Main/Merge">001027</idno>
<idno type="wicri:Area/Main/Curation">001010</idno>
<idno type="wicri:Area/Main/Exploration">001010</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Semi-automatic Ground Truth Generation for Chart Image Recognition</title>
<author>
<name sortKey="Yang, Li" sort="Yang, Li" uniqKey="Yang L" first="Li" last="Yang">Li Yang</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">E06B873A38619286D9CCF8A173943039F895847F</idno>
<idno type="DOI">10.1007/11669487_29</idno>
<idno type="ChapterID">29</idno>
<idno type="ChapterID">Chap29</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: While research on scientific chart recognition is being carried out, there is no suitable standard that can be used to evaluate the overall performance of the chart recognition results. In this paper, a system for semi-automatic chart ground truth generation is introduced. Using the system, the user is able to extract multiple levels of ground truth data. The role of the user is to perform verification and correction and to input values where necessary. The system carries out automatic tasks such as text blocks detection and line detection etc. It can effectively reduce the time to generate ground truth data, comparing to full manual processing. We experimented the system using 115 images. The images and ground truth data generated are available to the public.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
<orgName>
<li>Université nationale de Singapour</li>
</orgName>
</list>
<tree>
<country name="Singapour">
<noRegion>
<name sortKey="Yang, Li" sort="Yang, Li" uniqKey="Yang L" first="Li" last="Yang">Li Yang</name>
</noRegion>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<name sortKey="Yang, Li" sort="Yang, Li" uniqKey="Yang L" first="Li" last="Yang">Li Yang</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001010 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001010 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:E06B873A38619286D9CCF8A173943039F895847F
   |texte=   Semi-automatic Ground Truth Generation for Chart Image Recognition
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024